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Abstract 

Background: The monkey malaria parasite Plasmodium knowlesi is now recognized as the fifth species of 
Plasmodium that can cause human malaria. Like the region II of the Duffy binding protein of P. vivax (PvDBPII), the 
region II of the P. knowlesi Duffy binding protein (PkDBPall) plays an essential role in the parasite's invasion into the 
host's erythrocyte. Numerous polymorphism studies have been carried out on PvDBPII, but none has been reported 
on PkDBPall. In this study, the genetic diversity, haplotyes and allele groups of PkDBPall of P. knowlesi clinical 
isolates from Peninsular Malaysia were investigated. 

Methods: Blood samples from 20 knowlesi malaria patients and 2 wild monkeys {Macaca fascicularis) were used. 
These samples were collected between 2010 and 2012. The PkDBPall region of the isolates was amplified by PCR, 
cloned into Escherichia coli, and sequenced. The genetic diversity, natural selection and haplotypes of PkDBPall 
were analysed using MEGA5 and DnaSP ver. 5.10.00 programmes. 

Results: Fifty-three PkDBPall sequences from human infections and 6 from monkeys were obtained. Comparison at the 
nucleotide level against P. knowlesi strain H as reference sequence showed 52 synonymous and 76 nonsynonymous 
mutations. Analysis on the rate of these mutations indicated that PkDBPall was under purifying (negative) selection. At 
the amino acid level, 36 different PkDBPall haplotypes were identified. Twelve of the 20 human and 1 monkey blood 
samples had mixed haplotype infections. These haplotypes were clustered into 2 distinct allele groups. The majority of 
the haplotypes clustered into the large dominant group. 

Conclusions: Our present study is the first to report the genetic diversity and natural selection of PkDBPall. Hence, the 
haplotypes described in this report can be considered as novel. Although a high level of genetic diversity was observed, 
the PkDBPall appeared to be under purifying selection. The distribution of the haplotypes was skewed, with one 
dominant major and one minor group. Future study should investigate PkDBPall of P. knowlesi from Borneo, which 
hitherto has recorded the highest number of human knowlesi malaria. 
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Background 

Malaria is caused by blood protozoa of the genus Plasmo- 
dium. Historically, four species of Plasmodium are known 
to cause human malaria: Plasmodium falciparum, P. vivax, 
P. malariae, and P. ovale. However, about a decade ago, P. 
knowlesi, a malaria parasite of macaque monkeys, was 
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reported to cause a large number of human infections in 
Sarawak, Borneo Island [1]. Since then, human knowlesi 
malaria has been reported in other parts of Borneo Island, 
Peninsular Malaysia, and in many other countries in 
Southeast Asia [2], Imported knowlesi malaria cases have 
been reported in European countries and Japan due to eco- 
tourism programs to the forested areas of Southeast Asia. 
Now, P. knowlesi is recognised as the fifth species of 
Plasmodium that can cause human malaria. 

Invasion of a malaria parasite into its host erythrocyte 
depends on the interaction between the parasites 
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protein and the corresponding receptor on the surface 
of the erythrocyte. Plasmodium vivax and P. knowlesi 
use the Duffy blood group antigen as a receptor to in- 
vade erythrocytes [3]. The Duffy binding proteins of P. 
vivax (PvDBP) and P. knowlesi (PkDBP) are located on 
their merozoites. PvDBP and PkDBP are members of the 
erythrocyte-binding protein family which also includes 
the P. falciparum EBA-175 [4]. PvDBP and PkDBP are 
large proteins and can be divided into seven regions 
(I- VII). Region II contains the critical motifs for binding 
to the erythrocyte Duffy antigen. 

PkDBP is encoded by an a-gene and therefore is more 
specifically known as PkDBPa. This is to distinguish it from 
two other highly homologous proteins in P. knowlesi - |3 
and y. Region II of the p and y proteins have different bind- 
ing specificities compared to PkDBPa. Region II of the p 
and y proteins binds to rhesus erythrocytes but does not 
bind to Duffy antigen of human erythrocytes [5]. 

PvDBP has been suggested to be an important vaccine 
candidate antigen against vivax malaria because it elicits 
strong immune responses in humans. Design of vaccine 
against vivax malaria must take into consideration the 
nature and genetic polymorphism of PvDBP. Region II 
of PvDBP (denoted as PvDBPII) in P. vivax isolates from 
different geographical regions such as Colombia, South 
Korea, Papua New Guinea, Thailand, Iran and Myanmar 
has been shown to be highly polymorphic, and numer- 
ous PvDBPII haplotypes and allele groups have been 
identified [6-11]. 

It has been observed that antibodies raised against 
PkDBPall could inhibit P. knowlesi invasion of human 
and rhesus erythrocytes in vitro [12]. Therefore, like 
PvDBPII for vivax malaria, PkDBPall may also be a can- 
didate vaccine antigen against knowlesi malaria. Whilst 
many polymorphism studies on PvDBPII have been car- 
ried out, hitherto, none has been reported for PkDBPall. 
In this report, we present our findings on genetic diver- 
sity, haplotyes and allele groups of PkDBPall in P. know- 
lesi clinical isolates from Peninsular Malaysia. 

Methods 

Blood samples 

Human blood samples used in this study were collected 
from 20 patients who were infected with Plasmodium 
knowlesi at the University of Malaya Medical Centre and 
several private clinics in Peninsular Malaysia. Two blood 
samples from P. knowlesi-infected wild monkeys (Macaca 
fascicularis) were provided by the Wildlife Department of 
Federal Territory, Peninsular Malaysia. These blood sam- 
ples were collected between 2010 and 2012 from various 
states in Peninsular Malaysia (Table 1). Confirmation of 
P. knowlesi infection in all the samples was carried out by 
microscopic examination of Giemsa-stained thin and thick 
blood smears and polymerase chain reaction. Ethical 



Table 1 Date and origin of human and monkey blood 
samples used in this study 



DlOOU 

sample 


Date 
(Month, year) 


Origin 

(State in Peninsular Malaysia) 


ANU 


Feb 2012 


Kelantan 


AZL 


Jun 2011 


Perak 


CHO 


Oct 2010 


Pahang 


GAN 


Oct 201 1 


Selangor 


HAI 


Feb 2012 


Kelantan 


HAN 


Feb 2012 


Kelantan 


HEN 


Feb 2012 


Kelantan 


IZA 


Aug 2010 


Selangor 


JUN 


Feb 2012 


Kelantan 


MAD 


Aug 2010 


Negeri Sembilan 


MAH 


Aug 2011 


Selangor 


MEL 


Feb 2012 


Kelantan 


NGO 


Oct 2010 


Perak 


OTH 


Feb 2012 


Kelantan 


RAU 


Feb 2012 


Kelantan 


SUP 


Feb 2012 


Kelantan 


SYA 


Feb 2012 


Kelantan 


UM0001 


Jan 2012 


Selangor 


UM0002 


Feb 2012 


Kelantan 


UM0004 


Apr 2012 


Selangor 


Monkey566 


Oct 2010 


Negeri Sembilan 


Monkey569 


Oct 2010 


Pahang 



approvals for the use of human and monkey blood samples 
in this study were granted by the University of Malaya 
Medical Centre Ethic Committee and (MEC No. 817.18) 
and the University of Malaya Animal Care and Use Com- 
mittee (PAR/19/02/2013/AA[R]). 

Extraction of DNA 

Total DNA of the P. knowlesi was extracted from each 
blood sample using the QIAGEN Blood DNA Extraction 
kit (QIAGEN, Hilden, Germany). In each extraction, 
100 \A of blood was used. The extracted DNA was sus- 
pended in water to a final volume of 50 ul. 

PCR, cloning and sequencing of the PkDBPall 

The PkDBPall was amplified by nested PCR using oligo- 
nucleotide primers Pka-DBP-Fl: 5 -CGCATTTTGAA 
GGAATCCAC-3 ' and Pka-DBP-Rl: 5 -TGCTAGACT 
TACCTTCACCT-3 ' for nest 1. The primers for the nest 
2 reaction were Pka-DBP-F: 5 -TCCTCAAAAGGCGG 
TGACCATCC-3' and Pka-DBP-R: 5-ACTGGCTGCC 
TTAGATTCAACACCA-3 '. Cycling conditions for nest 
1 were as follows: 95°C for 4 mins, 30 cycles at 95°C for 
30 sees, 48°C for 30 sees, and 72°C for 90 sees, followed 
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Figure 1 Nucleotide polymorphism in the PkDBPall of P. knowlesi. Sliding window plot of number of polymorphic sites (S) along the 
PkDBPall, generated by using DnaSP ver. 5.10.00 with a window length of 100 bp and step size of 25 bp. 



by a 10 min extension at 72°G The amplification for 
nest 2 was performed using the following cycling profile: 
95°C for 4 mins, 30 cycles at 95°C for 30 sees, 56°C for 
30 sees, and 72°C for 90 sees, followed by a 10 min ex- 
tension at 72°C The PCR product with an expected size 
of 1053 bp was analyzed on a 1% agarose gel 



Purification of PCR products and DNA cloning 

PCR products were purified by QIAquick PCR purifica- 
tion Kit (QIAGEN, Hilden, Germany) following the 
manufacturers instructions. The purified PCR products 
were then ligated into cloning vector pGEM-T @ (Pro- 
mega Corp., USA) and transformed into Escherichia coli 
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Figure 2 Amino acid sequence polymorphism in PkDBPall. Polymorphic amino acid residues are listed for each haplotype. Amino acid 
residues identical to those of the reference sequence, strain H (haplotype 1), are marked in yellow. Monomorphic and dimorphic amino acid 
changes are marked in blue and red, respectively. Total number of sequences for each haplotype is listed in the right panel. 
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TOP10F'. Plasmids of recombinant clones harbouring 
the PkDBPall fragment were sent to a commercial la- 
boratory for DNA sequencing. To detect the possibility 
of multiple haplotypes infecting a patient or monkey, 
plasmids from 4-6 recombinant clones from each trans- 
formation mixture were sequenced. 

Analysis of PkDBPall sequences 

Alignment of 60 sequences of PkDBPall [53 of human 
origin, 6 of monkey origin, 1 reference strain (strain H, 
GenBank Accession No. M90466)] were performed 
using CLUSTAL- Omega programme which was available 
on-line (http://www.ebi.ac.uk/Tools/msa/clustalo). Both 
nucleotide and the deduced amino acid sequences were 
aligned and analysed. Phylogenetic tree was constructed 
using the Neighbour Joining method described in 
MEGA5 [13]. In constructing the phylogenetic tree, 
bootstrap replicates of 1000 were used to test the ro- 
bustness of the tree. 

PkDBPall sequence polymorphism analysis 

DnaSP ver. 5.10.00 [14] was used to perform polymorphism 
analysis on the 60 PkDBPall sequences. Information such 
as the number of segregating sites (S), haplotype diversity 
(Hd), nucleotide diversity (tt), and average number of pair- 
wise nucleotide differences within the population (K) were 
generated. The tt was also calculated on a sliding window 
of 100 bases, with a step size of 25 bp to estimate the step- 
wise diversity across PkDBPall. The rates of synonymous 
(7<s) and non-synonymous (Kn) substitutions were esti- 
mated and compared by the Z-test (P < 0.05) in MEGA5 
using the Nei and Gojoboris method [15] with the Jukes 
and Cantor correction. In the case of purifying (negative) 
selection, mutations are usually not advantageous so that 
Kn will be less than Ks (Kn/Ks < 1). However, in positive se- 
lection, non-synonymous mutations can be advantageous 
and Kn will exceed Ks (Kn/Ks > 1). For testing the neutral 
theory of evolution, Tajimas D [16] and Fu and Lis D and F 
[17] tests was carried out using DnaSP 5.10.00. In the Fu 
and Lis tests, P. vivax PvDBPII (GenBank Accession No. 
M90466) was used as outgroup. 

Results 

Nested PCR amplification on the human and monkey 
blood samples isolates produced DNA fragments of 1053 
base pairs. The sequence of each fragment was trimmed to 
921 bp, based on the PkDBPall region identified by Singh 
et al [18]. This trimmed sequence encoded an amino acid 
sequence of 307 in length. A final total of 59 sequences 
(GenBank Accession No. KC597079 - KC597137) were 
obtained. 

DNA sequence analyses were conducted to determine 
nucleotide diversity and genetic differentiation. The aver- 
age number of pairwise nucleotide differences (K) for the 



PkDBPall was 11.736. The overall haplotype diversity 
(Hd) and nucleotide diversity (tt) for the 60 PkDBPall se- 
quences were 0.986 ± 0.008 and 0.013 ± 0.002, respectively. 
Detailed analysis of tt, with a sliding window plot (window 
length 100 bp, step size 25 bp), revealed diversity ranged 
from 0.003 to 0.034. The highest peak of nucleotide diver- 
sity was within nucleotide positions 600-750, whereas the 
most conserved region was within nucleotide positions 
260-360 (Figure 1). 

Analysis and comparison at the nucleotide level against 
P. knowlesi strain H as reference sequence showed muta- 
tions at 128 positions among the Peninsular Malaysia iso- 
lates. Fifty-two of these mutations were synonymous and 
76 were nonsynonymous. To determine whether natural 
selection contributed to the diversity in the PkDBPall, the 
rate of nonsynonymous (Kn) to synonymous mutations 
(Ks) was estimated. Kn (0.00952) was found lower than Ks 
(0.00278) and the Kn/Ks ratio was 0.347, suggesting that 
purifying (negative) selection may be occurring in the 
PkDBPall. Similarly, the Z test (Ks > Kn; P < 0.05) also 
indicated purifying selection on PkDBPall. In the tests of 
departure of neutrality of selection, the Tajimas D statis- 
tics was -2.085 (P <0.05), indicating expansion in 



Table 2 PkDBPall haplotypes detected in the human and 
monkey blood samples 



Blood sample 


Haplotype 


ANU 


H2 


AZL 


H9, H10, H11 


CHO 


H3, H8 


GAN 


H5, H6, H7 


HAI 


H18, H19 


HAN 


H2 


HEN 


H2 


IZA 


H2, H3, H4 


JUN 


H15 


MAD 


H24, H25, H26, H27, H28, H29 


MAH 


H2, H33 


MEL 


H12, H13, H14 


NGO 


H2 


OTH 


H2, H16 


RAU 


H2, H30, H31 


SUP 


H17 


SYA 


H2, H20 


UM0001 


H21 


UM0002 


H2, H22, H23 


UM0004 


H32 


Monkey566 


H2 


Monkey569 


H34, H35, H36 
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population size and/or purifying selection. This is further 
supported by the Fu and Lis D and F tests statistics 
(-3772 and -3756, respectively; P < 0.05). 

Comparison at the amino acid level revealed high 
polymorphism across the entire PkDBPall of the isolates 
(Figure 2). Among the 70 polymorphic sites, 66 showed 
monomorphic mutation (changed into one amino acid- 
type) and 4 showed dimorphic mutation [changed into two 
amino acid types: 221 (Q, T), 302 (N, Y), 303 (F, S), 307 
(A, I)]. The amino acid sequences could be categorised into 
36 different haplotypes (H1-H36) with haplotype 2 having 
the highest frequency (19/60). Twelve of the 20 human and 
1 monkey blood samples had mixed haplotype infections 
(Table 2). Phylogenetic tree analysis revealed that the haplo- 
types could be clustered into 2 main allele groups 
(Figure 3). 



Discussion 

The DBP of P. vivax and P. knowlesi play an essential 
role in erythrocyte invasion of the parasites by mediating 
irreversible binding with its corresponding receptor, the 
Duffy protein receptor for chemokines (DARC) on the 
surface of erythrocytes [19]. The DBP elicits strong im- 
mune responses in humans and therefore has been sug- 
gested to be an important vaccine candidate antigen 
[12,20]. The critical erythrocyte-binding motif of DBP 
has been identified to be located in region II of the pro- 
tein. The P. vivax PvDBP has been observed to have a 
high degree of genetic polymorphism, and most of the 
polymorphism is located in region II. Although the cyst- 
eine and some hydrophobic amino acid residues in 
PvDBPII are conserved in P. vivax populations from dif- 
ferent geographic regions, but the remaining amino acid 



70 



6: 
15' 

55 
22 



37 
45 



51 



68 
41. 



H1 

H2 

-H33 
-H13 
-H15 

— H17 

— H14 
H36 
-H30 

H34 
H35 
H32 



T 



H18 



"L|H22 
H12 
H16 
H31 
h— H20 
" — h 



97 



H23 

— H6 

— H7 
H5 

-H21 

— H26 
-H25 
-H27 
-H29 

33r H24 
H28 
-H19 

r H4 
H3 



89 
5C 



H8 
H9 
H10 



Allelle group I 



-H11 

— P. wVaxPvDBPII 



Allelle 



group I 



Figure 3 Phylogenetic tree of PkDBPall haplotypes. Neighbour-Joining method was used to construct the tree, which contains 36 haplotypes. 
Plasmodium vivax PvDBPII was used as outgroup. Numbers at nodes indicate percentage support of 1000 bootstrap replicates. 
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residues are highly polymorphic [6-11,21-24]. It has fur- 
ther been revealed that PvDBPII undergoes positive nat- 
ural pressure which results in allelic variation as a 
mechanism for immune evasion. On the other hand, 
despite the emerging importance of P. knowlesi as a hu- 
man pathogen, no study has thus far been carried out on 
PkDBPall. Our present study is the first to report the di- 
versity and natural selection of PkDBPall. 

The PkDBPall analysed in this study is based on the 
region defined by Singh et al. [18] using structural biol- 
ogy analysis. In their analysis, twelve C residues (posi- 
tions 16, 29, 36, 45, 99, 176, 214, 226, 231, 235, 304, 
306), which form six disulphide bridges, have been sug- 
gested to be involved in the folding of PkDBPall for 
interaction with DARC. Multiple alignment of the 60 
PkDBPall amino acid sequences (Additional file 1) in 
our study revealed that these 12 residues were highly 
conserved. Apart from these conserved C residues, the 
Y94, N95, K96, R103, L168 and 1175 residues are re- 
quired for recognition of DARC on human erythrocytes 
[18]. Again, our multiple sequence alignment showed 
high conservation of these residues in the PkDBPall. In 
fact, Y94, N95, K96 and R103 fall within the most con- 
served region in the PkDBPall (nucleotide positions 
260-360 in the gene, Figure 1). 

The PkDBPall (K = 11.736; Hd =0.986; n =0.01274) in 
Peninsular Malaysia seemed to be more diverse than 
P. vivax PvDBPII in some neighbour countries, such as 
Myanmar (K = 7.851; Hd = 0.875; tt = 0.00790), Korea (K = 
2.878; Hd = 0.775; tt = 0.00299) and Sri Lanka (Hd = 0.922; 
tt =0.00982) [7,11,25]. Generally, high levels of genetic di- 
versity in malaria parasite surface antigens are attributed to 
positive natural selection, for example, by the host immune 
system [26]. Positive selection has been reported in 
PvDBPII of isolates in Myanmar, Korea, Sri Lanka and 
Thailand [7,9,11,25]. On the contrary, the PkDBPall in 
Peninsular Malaysia was found to be under purifying (nega- 
tive) selection. The reason for this finding is unclear, but 
may be associated with the host of the parasite. For ex- 
ample, it has been reported that the rhoptry-associated pro- 
tein 1 (RAP-1) antigen of non-human primate malarial 
parasites (P. knowlesi, P. cynomolgi, P. inui and P. fieldi) 
showed evidence for negative selection, which was not 
found in two human malarial parasites (P. falciparum, P. 
vivax) [27]. Another possible reason for the purifying selec- 
tion on PkDBPall is population expansion of P. knowlesi, 
as evident by the Tajimas D statistics. In Peninsular 
Malaysia, deforestation in many areas has increased mon- 
key and forest dwelling Anopheles vector contact to human. 
The expansion of monkey and vector populations and their 
ecological niches into human habitation may therefore re- 
sult in the population expansion of P. knowlesi [28]. 

Phylogenetic studies on P. vivax isolates based on 
PvDBPII revealed the occurrence of multiple haplotypes 



in a geographical location, and these haplotypes could 
be categorized into several allele groups. For example, 
25 haplotypes from Thailand were organized into 5 
groups [9]. In Papua New Guinea, 27 haplotypes clus- 
tered into 3 dominant groups [9], whereas in Korea, 13 
haplotypes were clustered into 3 groups [7]. Our study 
revealed the occurrence of 2 distinct allele groups of 
PkDBPall in Peninsular Malaysia (Figure 3). Interest- 
ingly, allele group I was more dominant as it contained 
the majority (30/36) of the haplotypes. The minor allele 
group II had only 6 haplotypes. Two monkey blood sam- 
ples were included in this study but all haplotypes from 
the monkeys were grouped in allele group I. The refer- 
ence strain H, which was isolated in 1965 in Peninsular 
Malaysia from the first reported case of human P. know- 
lesi infection [29], was also grouped in allele group I. 
This shows that despite temporal separation of almost 
50 years between strain H and the recent isolates of this 
study, no major genetic differences had occurred in the 
PkDBPall. 

Conclusions 

Our present study is the first to report the genetic diver- 
sity and natural selection of PkDBPall. Hence, the hap- 
lotypes described in this report can be considered as 
novel. Although a high level of genetic diversity was ob- 
served, the PkDBPall appeared to be under purifying 
(negative) selection. Two allele groups of haplotype were 
obtained. However, the distribution was skewed, in 
which majority of the haplotypes clustered in a large 
dominant group. Our study was carried out on Peninsu- 
lar Malaysia isolates only. Therefore, future study should 
investigate PkDBPall of P. knowlesi from Borneo, which 
hitherto has recorded the highest number of human 
knowlesi malaria. 

Additional file 



Additional file 1: Full amino acid sequence alignment of PkDBPall. 

Amino acid residues identical to those of the reference sequence (strain 
H), are indicated by dots. The twelve conserved cysteine (C) residues are 
marked in yellow. The conserved Y94, N95, K96, R103, L168 and 1175 
residues required for recognition of DARC on human erythrocytes are 
highlighted in green. 
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